Prom time series to superstatistics 



School of Mathematical Sciences, 



Christian Beck 

^ueen Mary, University of London, Mile End Road, London El 4NS, UK 



in 
o 
o 



oo 



> 

a 

o 



o 
o 



X3 
O 

o 



X 



Ezechiel G.D. Cohen 
The Rockefeller University, 1230 York Avenue, New York, New York 10021, USA 

Harry L. Swinney 
Center for Nonlinear Dynamics and Department of Physics, 
University of Texas at Austin, Austin, Texas 78712, USA 
(Dated: February 2, 2008) 

Complex nonequilibrium systems are often effectively described by a 'statistics of a statistics', 
in short, a 'superstatistics'. We describe how to proceed from a given experimental time series 
to a superstatistical description. We argue that many experimental data fall into three different 
universality classes: x^-superstatistics (Tsallis statistics), inverse x^-superstatistics, and log-normal 
superstatistics. We discuss how to extract the two relevant well separated superstatistical time scales 
T and T, the probability density of the superstatistical parameter /3, and the correlation function 
for /3 from the experimental data. We illustrate our approach by applying it to velocity time series 
measured in turbulent Taylor-Couette flow, which is well described by log-normal superstatistics 
and exhibits clear time scale separation. 

PACS numbers: 05.40.-a, 42.27.Eq 



I. INTRODUCTION 

Driven nonequilibrium systems of sufficient complex- 
ity are often effectively described by a superposition of 
different dynamics on different time scales. As a simple 
example consider a Brownian particle moving through 
a changing environment. A relatively fast dynamics is 
given by the velocity of the Brownian particle, and a 
slow dynamics is given, e.g., by temperature changes of 
the environment. The two effects are associated with two 
well separated time scales, which result in a superposi- 
tion of two statistics, or in a short, a superstatistics (SS) 
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2Ct 1211 1221 |23j . The stationary distributions of supersta- 
tistical systems typically exhibit non-Gaussian behavior 
with fat tails, which can decay with a power law, or as 
a stretched exponential, or in an even more complicated 
way. An essential ingredient of SS models is the exis- 
tence of an intensive parameter (3 that fluctuates on a 
large spatio-temporal scale T. For the above example of 
a Brownian particle, (3 is the fluctuating inverse temper- 
ature of the environment, but in general (3 can also be 
an effective friction constant, a changing mass parame- 
ter, a changing amplitude of Gaussian white noise, the 
fluctuating energy dissipation in turbulent flows, or sim- 
ply a local variance parameter extracted from a signal. 
The SS concept is quite general and has recently been 
applied to a variety of physical systems, including La- 
grangian 0, 0, lial and Eulerian [T^ ll^ turbulence, 
defect turbulence Il5l. a tmospheric turbulence (l6j, 0| , 
cosmic ray statistics [13, solar flares [l9l. r andom net- 
works I20II21II and mathematical finance|2^ l23| . 

In this paper we address a problem that is of great 
interest in experimental applications. Given an exper- 



imentally measured time series or signal, how can we 
check if this time series is well described by a superstatis- 
tical model, i.e., does it contain two separate time scales, 
and how can we extract the relevant superstatistical pa- 
rameters from the time series? Further, since there are 
infinitely many SS 0, which ones are the most relevant 
for typical experimental situations? 

We argue that many experimental data (see, e.g., 
[Hill 13, 16, 24, 25, 26]) are well described by three ma- 
jor universality classes, namely x^, inverse x^, and log- 
normal SS. These SS represent a universal limit statistics 
for large classes of dynamical systems. We then show 
how to extract the superstatistical parameters from a 
given experimental signal. Our example is a time series 
of longitudinal velocity differences measured in turbulent 
Taylor-Couette flow [23|. We will extract the two rele- 
vant time scales r and T from the data and show that 
there is clear time scale separation for our example. We 
will also investigate the probability density of (3 and the 
/3-correlation function. While our turbulent time series 
appears to fall into the universality class of log-normal 
superstatistics, our concepts are general and can in prin- 
ciple be applied to any time series. 



II. SUPERSTATISTICAL UNIVERSALITY 
CLASSES 

Consider a driven nonequilibrium system which is in- 
homogeneous and consists of many spatial cells with dif- 
ferent values of some intensive parameter (3 (e.g., the 
inverse temperature) . The cell size can be determined by 
the correlation length of the continuously varying /3-field. 
Each cell is assumed to reach local equilibrium very fast, 
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i.e., the associated relaxation time r is short. The param- 
eter (3 in each cell is approximately constant during the 
time scale T, then it changes to a new value. In the long- 
term run (t >> T), the stationary distributions of this 
inhomogcncous system arise as a superposition of Boltz- 
mann factors e~^^ weighted with the probability density 
/(/3) to observe some value /3 in a randomly chosen cell: 



P{E) = / 
Jo 
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(1) 



Here E is an effective energy for each cell, p{E) is the 
density of states, and Z(f3) is the normalization constant 
of p{E)e~'^^ for a given /3. The simplest example is a 
Brownian particle of mass m moving through a changing 
environment in d dimensions. For its velocity v one has 
the local Langevin equation 



V = —jv + (jL{t) 



(2) 



{L(t): d-dimensional Gaussian white noise) which be- 
comes superstatistical because for a fluctuating environ- 
ment the parameter P '■= -^-^ becomes a random vari- 
able as well: it varies from cell to cell on the large spatio- 
temporal scale T. Of course, for this example E = ^mip, 
and while on the time scale T the local stationary distri- 
bution in each cell is Gaussian, 
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the marginal distribution describing the long-time behav- 
ior of the particle for t >> T, 



p{v) 



f{P)p{v\P)d(3 



(4) 



exhibits non-trivial behavior. The large-|u| tails of the 
distribution Q depend on the behavior of for (3^0 
js]. For example, if is a x^-distribution of degree n, 
then eq. Q generates Tsallis statistics [2^ l29l | , with en- 
tropic index q given by g = 1 -I- ^^-^^ |3- Of course, a 
necessary condition for a superstatistical description to 
make sense is the condition t = 7"^ << T, because oth- 
erwise the system is not able to reach local equilibrium 
before the next change of (3 takes place. In superstatis- 
tical turbulence models Wl, flol 0, Il2l 0| , one formally 
replaces the variable v in eq. (jSjl by the velocity difference 
u (or acceleration a on smallest scales) , and f3 is related 
to energy dissipation e. 

The distribution /(/3) is determined by the spatio- 
temporal dynamics of the entire system under consider- 
ation. By construction, /? is positive, so /(/?) cannot be 
Gaussian. Let us here consider three examples of what 
to expect in typical experimental situations. 

(a) There may be many (nearly) independent micro- 
scopic random variables ^j, j = 1,...,J, contributing 
to P in an additive way. For large J their rescaled sum 
=1 ^ill approach a Gaussian random variable 



Xi due to the Central Limit Theorem. In total, there 
can be many different random variables consisting of mi- 
croscopic random variables, i.e., we have n Gaussian ran- 
dom variables Xi , . . . , Xn due to various relevant degrees 
of freedom in the system. As mentioned before, [3 needs 
to be positive; a positive [3 is obtained by squaring these 
Gaussian random variables. The resulting (3 = X]"=i -^I 
is x^-distributed with degree n. 
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where /3o is the average of (3. As shown in |^ Isoll. the SS 
resulting from Q and (0) is Tsallis statistics [23. It ex- 
hibits power-law tails for large \v\. Our above argument 
shows that Tsallis statistics arises as a universal limit dy- 
namics, i.e., the details of the microscopic random vari- 
ables (e.g., their probability densities) are irrelevant. 

(b) The same considerations as above can be applied if 
the 'temperature' /3~^ rather than 13 itself is the sum of 
several squared Gaussian random variables arising out of 
many microscopic degrees of freedom f^. The resulting 
f{(3) is the inverse x^-distribution given by 



/(/3) 



/3o 

r(§) 
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n/2 



/3 



-n/2-2^-.^ 
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It generates superstatistical distributions I^J that have 
exponential decays in \v\ Again this superstatis- 

tics is universal: details of the are irrelevant. 

(c) Instead of (3 being a sum of many contributions, for 
other systems (in particular, turbulent ones) the random 
variable (3 may be generated by multiplicative random 
processes. We may have a local cascade random variable 
Xi = 11/= 1 Ci' where J is the number of cascade steps 
and the are positive microscopic random variables. By 
the Central Limit Theorem, for large J the random vari- 
able ^logXi = "^S/=ilogO becomes Gaussian for 
large J. Hence Xi is log-normally distributed. In gen- 
eral there may be n such product contributions to (3, i.e., 
j3 = nr=i^i- Then log/3 = J2i=i^'^sXi is a sum of 
Gaussian random variables; hence it is Gaussian as well. 
Thus (3 is log-normally distributed, i.e.. 
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where /x and are suitable mean and variance pa- 
rameters n\. For related turbulence models, see, e.g., 
,1(1 .111 I25I 3H . Again this log-normal result is indepen- 
dent of the details of the microscopic cascade random 
variables ^j; hence log-normal SS is universal as well. 

Although more complicated cases can be constructed, 
we believe that most experimentally relevant cases fall 
into one of these three universality classes, or simple com- 
binations of them, x^ superstatistics has been found for 
wind velocity fluctuations j3 and log-normal su- 
perstatistics has been found for Lagrangian 0, 0, ^| 
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and Eulerian [T^ IT^ turbulence. Candidate systems for 
inverse superstatistics are systems exhibiting velocity 
distributions with exponential tails 0, 0| . 



III. APPLICATION TO EXPERIMENTAL TIME 
SERIES 

Suppose some experimental time series u(t) is given 
[3^. Our goal is to test the hypothesis that it is due to 
a superstatistics and if so, to extract the two basic time 
scales r and T as well as /(/3). First let us determine 
the large time scale T. For this we divide the time series 
into N equal time intervals of size At. The total length 
of the signal is tmax — NAt. We then define a function 
K{At) by 



K{At) 



.-At 



dti 



At 



(8) 



Here 



)to,At ~ Ito~^^* ' ' ' denotes an integration 
over an interval of length At starting at to, and u is 
the average of u{t) (we may either choose u to be a local 
average in each cell or a global average over the entire 
time series — our results do not depend on this choice in 
a significant way). Equation ((SJ simply means that the 
local flatness is evaluated in each interval of length At, 
and the result is then averaged over all to- We now define 
the superstatistical time scale T by the condition 



k{T) = 3. 



(9) 



Clearly this condition simply implies that we are looking 
for the simplest SS, a superposition of local Gaussians, 
which have local flatness 3 (see |^ for similar ideas). 
If At is so small that only one constant value of u is 
observed in each interval, then of course K{At) — 1. On 
the other hand, if At is so large that it includes the entire 
time series, then we obtain the flatness of the distribution 
of the entire signal, which will be larger than 3, since 
superstatistical distributions are fat-tailed. Hence there 
exists a time scale T satisfying @. 

The function /t(Ai) is shown in Fig. 1 for longitu- 
dinal velocity differences, u{t) = v{t + 6) — v{t), mea- 
sured in Taylor-Couette flow at Reynolds number Re — 
540000 |23|. The total number of measurement points 
was 2 X 10^, and in the present analysis At < 1000, so 

> 2 X 10'*, which means there is sufficient statistics to 
obtain precise values for the time scales T and r. For de- 
tails of the experiment, see j27j. For each time difference 
d (measured in units of the sampling frequency, which 
was 2500 times the inner cylinder rotation frequency), 
the relevant superstatistical time scale T leading to lo- 
cally Gaussian behavior is extracted in Fig. 1. The time 
scales T have to be compared with the relaxation times 
T = 7^* of the dynamics, which can be estimated from 
the short-time exponential decay of the correlation func- 
tion Cu{t — t') = {u{t)u{t')) of the velocity difference u 
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FIG. 1: Determination of the superstatistics long time scale 
T from the flatness function ^(At) given in eq. (8), for 5 = 
2-' ,j = 0, 1, 2, . . . , 7 (from top to bottom). The intersections 
with the line k = 3 yield T = 39, 42, 58, 100, 184, 320, 600, 948, 
respectively. Time is nondimensional (see text). 




FIG. 2: Determination of the superstatistics short time scale t 
from the decay of the correlation function Cu (t) of the velocity 
difference. Defining r by Cu(r) = e~^C„(0), we obtain for 5 = 
2^j = 0, 1,2,--- ,7, r = 2.1,2.3,2.8,4.3,7.2,12.1,19.9,29.5, 
respectively. 



(Fig. 2). We find that the ratio T/r 17.. .34 is large 
compared to unity, and the ratio has only a weak (loga- 
rithmic) dependence on S (Fig. 3). Thus there are indeed 
two well separated time scales in the time series for tur- 
bulent Couette- Taylor flow. 

Laboratory data were obtained for a wide range of 
Reynolds numbers, so we can also examine how the time 
scale ratio changes with the Reynolds number Re. Fig. 4 
shows that T/r increases with increasing Re, meaning 
that the superstatistics approach becomes more and more 
exact for Re — > 00. 

Next, we are interested in the analysis of the slowly 
varying stochastic process P{t). Since the variance of 
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(a) 
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'experiment' 




FIG. 3; The time scale ratio T/t, given as a function of 5 for 
turbulence data at Re = 540000, is large compared to unity. 
Thus the long and short time scales are well-separated, as 
required for superstatistics. The dashed line is a fit given by 
r/r = 17 + 3 In 5. 



(b) 



24 



22 



20 



16 



14 



12 







1 1 


1 




.0"' 















































100000 200000 300000 400000 500000 600000 
Re 

FIG. 4: Time scale ratio T/t as a function of Reynolds num- 
ber Re for 5 = 8, 4, 2, 1 (from top to bottom). 



the local Gaussians y ^''"^ is given by /3 
determine the process [3{t) from the time series as 

/3(io) ^ 



\2 ■ 

/to,T 



we can 
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We obtain the probabihty density /(/3) as a histogram of 
/3(to) for all values of as shown in Fig. 5. We compare 
the experimental data with log-normal, and inverse 
distributions with the same mean {(3) and variance 
(/3^) — as the experimental data. The fit of the data 
to a log-normal distribution is significantly better than to 
a or inverse x^ ■ Indeed, the cascade picture of energy 
dissipation in turbulent flows suggests that our time se- 
ries should belong to the log-normal universality class of 
superstatistics (see section II (c)). A power-law relation 
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FIG. 5: Probability density /(/3) extracted from the turbulent 
time series (5 = 16), and compared with log-normal, x^, and 
inverse distributions, on (a) linear-linear and (b) log-log 
plots. All distributions have the same mean and variance as 
the experimental data. 



between the energy dissipation rate e and (3 was found for 
the Couette- Taylor turbulence data in an analysis in 1^ . 
Note that if such a power-law relation is valid, then a log- 
normally distributed e implies a log-normally distributed 
/?, and vice versa. 

For superstatistics to make physical sense, the variable 
[3 must change slowly compared to u. This is indeed 
the case for our turbulence data, as Fig. 6 illustrates 
for a sample time series. A slow /3-dynamics also im- 
plies a slow correlation decay of the /?-correlation func- 
tion Ci3{t — t') — {i3{t)(3{t')) . For our data we observe 
a power-law decay with a small exponent, Cp{i) ~ t~'^ '^ 
(Fig. 7). This means that (3{t) indeed has a long memory 
and changes slowly, a necessary consistency condition for 
the superstatistics approach. For comparison, the cor- 
relation function of the longitudinal velocity difference 
u{t) first decays exponentially fast and only finally, for 
very large times, approaches a power-law decay of the 
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FIG. 6; Time series of (3(1) (top) and u{t) (bottom). For this 
example 5 — 2 and T = 42. 
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FIG. 7; Correlation functions C'i3{t) (top) and |Cu(t)! (bot- 
tom) for 5 — 1. The straight lines represent power laws with 
exponents -0.9 and -1.8. 



form Cu{t) ^ t ^-^j as shown in Fig. 7. Note that for 
t > 6 the /3-correlation is larger than the u-correlation 
by a factor 10. ..100. 

Finally, we may check the validity of the general SS 
formula 



f{l3)p{u\P)dp, 



(11) 



where p{u\f3) is the conditional distribution of the signal 
u(t) in cells of size T, and p{u) is the marginal stationary 
distribution of the entire signal. For log-normal super- 
statistics this means 



p{u) 



1 

27rs 



-1/2 



exp 
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(12) 

As shown in Fig. 8, there is excellent agreement be- 
tween the experimental histogram and the superstatis- 




FIG. 8: Comparison of the measured (fluctuating lines) and 
predicted (dashed lines) probability distribution p{u) for ve- 
locity differences on (a) semi- log plot, which emphasizes the 
tails, and (b) linear-linear plot, which emphasizes the peak. 
The predicted p{u) was obtained from eq. 1121 with /(/9) be- 
ing a log-normal distribution with the same parameters as in 
Fig. 5 (5 = 16). 



tical model prediction, both in the tails and in the region 
around the peak of the distribution. 

For any superstatistics one can formally define a pa- 



rameter q by 



pe] 
1 



(13) 



q measures in a quantitative way the deviation from 
Gaussian statistics. No fluctuations in /3 at all corre- 
spond to /(/?) — S{/3 — (3o) and q = I, i.e., ordinary 
equilibrium statistical mechanics. For x^-superstatistics, 
the above q is given hy q = 1 + 2/n 1, 30] and is nearly 
the same as the entropic index qt introduced by Tsallis 
11 El El (qr = 1 + 2/(n -I- 1)). For log-normal (LN) 
superstatistics, one can relate q to the flatness F — -^^jjyi 
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FIG. 9: The parameter q (eq. (13)) as a function of S, as eval- 
uated from eq. H13|l (q-fluctuations) and eq. p8|l (q-flatness). 

of the distribution p{u) . Since for log- normal superstatis- 
tics H 

{f^)LN = f^e^'^ (14) 
{P^)ln - M'e^^' (15) 
(u^) ^ fi-^e^'' (16) 
(u^) = Sf,'^e'^\ (17) 

where /i and are the mean and variance parameters 
of the distribution 0, we arrive at the foUowing simple 
relation, 

q^e^' ^ (18) 

using eq. (|13|l . We thus have two different equations to 
evaluate q for our time series. The first one, based on 
eq. 1)13(1 . is always valid (i.e., for any /(/?)), whereas the 
second one, based on eq. H18|l . should coincide with the 
first one provided the system is described by log-normal 
superstatistics. Figure 9 shows the g-values that we ex- 
tract from the experimental data for Re = 540000 using 



both methods. As expected, q decreases monotonically 
with scale 6. Both curves agree well for S > 2. This 
indicates that log-normal superstatistics is a good model 
for our data, and that our extraction of the time scale 
T of the process (3{t) is consistent. Significant deviations 
between the two g- values occur only on the smallest scale 
(5 = 1, where the experiment reaches its resolution limits. 



IV. CONCLUSIONS 

In this paper we have advocated the view that the 
non-Gaussian behavior of many complex driven nonequi- 
librium systems can often be understood as a superpo- 
sition of two different statistics on different time scales, 
in short, a superstatistics. We have argued that typi- 
cal experimental situations are described by three rel- 
evant universality classes, namely x^, inverse x^i Eind 
log-normal superstatistics. Our example, turbulent Tay- 
lor Couette flow, falls into the universality class of log- 
normal superstatistics. This means the time series is 
essentially described by local Boltzmann factors e~2^" 
whose variance parameter f3 varies slowly according to 
a log-normal distribution function. We have developed 
a general method to extract from data the process f3{t), 
its probability density /(/3), and its correlation function. 
Our approach is applicable to any experimental time se- 
ries. We have extracted the two relevant time scales r 
(the relaxation time to local equilibrium) and T (the large 
time scale on which the intensive parameter P fluctuates). 
Our main result is that for turbulent Taylor-Couette flow 
there is clear time scale separation, which is a necessary 
condition for a superstatistical description to make phys- 
ical sense. The ratio T/r grows logarithmically with the 
scale separation S on which longitudinal velocity differ- 
ences are investigated. Moreover, T/t also increases with 
increasing Reynolds number, making the superstatistical 
approach more and more exact for increasing Reynolds 
number. The experimentally measured distributions of 
/3 and u agree very well with the superstatistical model 
predictions. 
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